Ensure ivf postings lists are in docID order #129655

benwtrent · 2025-06-18T17:05:26Z

This PR is pretty basic, right now we don't enforce any ordering at all for our IVF postings lists.

It seems like we should at a minimum make sure they are in doc-id order.

If we decide to switch this in the future, at least we will have a consistent ordering.

elasticsearchmachine · 2025-06-18T17:05:50Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

benwtrent · 2025-06-18T17:24:25Z

I did some benchmarking, it doesn't give us much space savings (yet), but it didn't hurt performance.

iverase · 2025-06-18T17:35:31Z

The idea for not sorting by docId was to favour having vectors together that were not SOAR vectors so bulk scoring is
more effective.

benwtrent · 2025-07-01T20:15:48Z

I need to benchmark this in highly filtered scenarios (e.g. when we will search more centroids), to ensure this doesn't hurt search performance.

…rder

benwtrent · 2025-07-07T19:14:36Z

I ran some higher-recall filtered search scenarios and there is basically zero increase in query latency.

baseline:

index_name                      index_type  num_docs  index_time(ms)  force_merge_time(ms)  num_segments
------------------------------  ----------  --------  --------------  --------------------  ------------
cohere-wikipedia-docs-768d.vec         ivf   2000000          160346                243206             0
cohere-wikipedia-docs-768d.vec         ivf   2000000               0                     0             0
cohere-wikipedia-docs-768d.vec         ivf   2000000               0                     0             0

index_name                      index_type  n_probe  latency(ms)  net_cpu_time(ms)  avg_cpu_count     QPS  recall   visited  filter_selectivity
------------------------------  ----------  -------  -----------  ----------------  -------------  ------  ------  --------  ------------------
cohere-wikipedia-docs-768d.vec         ivf      200         3.68              0.00           0.00  271.74    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.69              0.00           0.00  271.00    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.65              0.00           0.00  273.97    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.68              0.00           0.00  271.74    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.68              0.00           0.00  271.74    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         6.26              0.00           0.00  159.74    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.19              0.00           0.00  192.68    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.19              0.00           0.00  192.68    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.48              0.00           0.00  182.48    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.28              0.00           0.00  189.39    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.23              0.00           0.00  191.20    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.18              0.00           0.00  193.05    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.13              0.00           0.00  194.93    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.56              0.00           0.00  179.86    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.03              0.00           0.00  198.81    0.94  12030.14                0.10

this PR:

index_name                      index_type  num_docs  index_time(ms)  force_merge_time(ms)  num_segments
------------------------------  ----------  --------  --------------  --------------------  ------------
cohere-wikipedia-docs-768d.vec         ivf   2000000          154027                237642             0
cohere-wikipedia-docs-768d.vec         ivf   2000000               0                     0             0
cohere-wikipedia-docs-768d.vec         ivf   2000000               0                     0             0

index_name                      index_type  n_probe  latency(ms)  net_cpu_time(ms)  avg_cpu_count     QPS  recall   visited  filter_selectivity
------------------------------  ----------  -------  -----------  ----------------  -------------  ------  ------  --------  ------------------
cohere-wikipedia-docs-768d.vec         ivf      200         3.80              0.00           0.00  263.16    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.75              0.00           0.00  266.67    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.96              0.00           0.00  252.53    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.68              0.00           0.00  271.74    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         3.70              0.00           0.00  270.27    0.94  46351.83                1.00
cohere-wikipedia-docs-768d.vec         ivf      200         5.34              0.00           0.00  187.27    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.17              0.00           0.00  193.42    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.21              0.00           0.00  191.94    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.18              0.00           0.00  193.05    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.17              0.00           0.00  193.42    0.94  29174.39                0.40
cohere-wikipedia-docs-768d.vec         ivf      200         5.10              0.00           0.00  196.08    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.07              0.00           0.00  197.24    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.17              0.00           0.00  193.42    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.08              0.00           0.00  196.85    0.94  12030.14                0.10
cohere-wikipedia-docs-768d.vec         ivf      200         5.08              0.00           0.00  196.85    0.94  12030.14                0.10

…rder

benwtrent · 2025-07-16T20:43:36Z

server/src/main/java/org/elasticsearch/index/codec/vectors/DefaultIVFVectorsWriter.java

            // keeping them in the same file indicates we pull the entire file into cache
-            docIdsWriter.writeDocIds(j -> floatVectorValues.ordToDoc(cluster[j]), size, postingsOutput);
+            postingsOutput.writeGroupVInts(docIds, size);
+            postingsOutput.writeGroupVInts(spillDocIds, overspillCluster.length);
+            onHeapQuantizedVectors.reset(centroid, size, j -> cluster[finalOrds[j]]);
+            bulkWriter.writeVectors(onHeapQuantizedVectors);
+            // write overspill vectors
+            onHeapQuantizedVectors.reset(centroid, overspillCluster.length, j -> overspillCluster[finalSpillOrds[j]]);
            bulkWriter.writeVectors(onHeapQuantizedVectors);


This sort of what you had in mind @iverase ?

yes. My idea is that we can use this information to first score the assignments of all the clusters we want to visit so we can ensure that the posting lists will be unique and have simpler (and faster) visiting logic, and later visit the spill assignments where we would have a more complex (slower) logic to remove already visited posting lists. The downside of this approach is that it will require more hops on the posting list files breaking a bit the disk friendly approach of this type of index.

Do you think this is something doable? it complicated the search logic quite a bit and maybe the benefits are too small. What do you think?

@iverase the main benefit of SOAR and overspilling in general is that fewer nProbe need to be gathered. I would expect us to score both regular and overspill up to some fraction of nprobe

I am not saying that we would not score both but doing one after the other so if there is not deletes, scoring unique posting lists would be faster (no need to process docIds before scoring them). I can see it gets hairy and it would require different logic branches which is not great.

If you see no performance impact, I think we can just order all docs. We can make the distinction later on if is ever required.

So, we already have all the vectors are already in doc order. But, when we combine initial and secondary assignments into one grouping, that is when you get a partial ordering.

Each assignment array is already in vector ordinal order, which also means that they are already in doc Id order.

This PR now just keeps them separate (no sorting required).

Overall, there don't seem to be significant performance gains.

I noticed slightly lower performance when no filters are provided.

I noticed higher performance with very restrictive filters are used with high-nprobe, but I don't really know why.

I wouldn't expect doc ID decoding to be a significant issue.

I am gonna leave this as it is and we can revisit it at a later time, unless you have a better intuition around this.

…csearch into ivf/ensure-doc-id-order

…rder

benwtrent · 2025-08-13T21:50:38Z

done elsewhere

Ensure ivf postings lists are in docID order

98e00ee

benwtrent added >non-issue :Search Relevance/Vectors Vector search v9.1.0 labels Jun 18, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Jun 18, 2025

benwtrent mentioned this pull request Jun 18, 2025

Can we switch to the regular postings list writer/reader for IVF? #129654

Open

elasticsearchmachine added v9.2.0 and removed v9.1.0 labels Jun 26, 2025

Merge remote-tracking branch 'upstream/main' into ivf/ensure-doc-id-o…

2f577e6

…rder

benwtrent added 5 commits July 9, 2025 10:51

Merge branch 'main' into ivf/ensure-doc-id-order

4bdfb5f

Merge remote-tracking branch 'upstream/main' into ivf/ensure-doc-id-o…

5d71e41

…rder

iter

c32ca50

iter testing some things

42b0c71

trying something crazy, separating overspill vectors to their own blocks

3eb747e

benwtrent commented Jul 16, 2025

View reviewed changes

elasticsearchmachine and others added 10 commits July 16, 2025 20:53

[CI] Auto commit changes from spotless

4d4e311

iter

d848f79

Merge branch 'ivf/ensure-doc-id-order' of github.com:benwtrent/elasti…

4832243

…csearch into ivf/ensure-doc-id-order

Merge branch 'main' into ivf/ensure-doc-id-order

8eb0d16

iter

d2eea2c

Merge remote-tracking branch 'upstream/main' into ivf/ensure-doc-id-o…

0cfc246

…rder

iter

bb8dcfc

iter

f6caf36

ensure doc id sorting keep soar separate from nearest

e32ab82

fixing filtered search

c5e8661

benwtrent closed this Aug 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ensure ivf postings lists are in docID order #129655

Ensure ivf postings lists are in docID order #129655

benwtrent commented Jun 18, 2025

Uh oh!

elasticsearchmachine commented Jun 18, 2025

Uh oh!

benwtrent commented Jun 18, 2025

Uh oh!

iverase commented Jun 18, 2025 •

edited

Loading

Uh oh!

benwtrent commented Jul 1, 2025

Uh oh!

benwtrent commented Jul 7, 2025

Uh oh!

benwtrent Jul 16, 2025

Uh oh!

iverase Jul 17, 2025

Uh oh!

benwtrent Jul 17, 2025

Uh oh!

iverase Jul 17, 2025

Uh oh!

benwtrent Jul 18, 2025

Uh oh!

benwtrent commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Ensure ivf postings lists are in docID order #129655

Ensure ivf postings lists are in docID order #129655

Conversation

benwtrent commented Jun 18, 2025

Uh oh!

elasticsearchmachine commented Jun 18, 2025

Uh oh!

benwtrent commented Jun 18, 2025

Uh oh!

iverase commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benwtrent commented Jul 1, 2025

Uh oh!

benwtrent commented Jul 7, 2025

Uh oh!

benwtrent Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

iverase Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

iverase Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iverase commented Jun 18, 2025 •

edited

Loading